Macro - Actions in Reinforcement Learning : An EmpiricalAnalysisAmy McGovern and Richard
نویسندگان
چکیده
Several researchers have proposed reinforcement learning methods that obtain advantages in learning by using temporally extended actions, or macro-actions, but none has carefully analyzed what these advantages are. In this paper, we separate and analyze two advantages of using macro-actions in reinforcement learning: the eeect on exploratory behavior, independent of learning, and the eeect on the speed with which the learning process propagates accurate value information. We empirically measure the separate contributions of these two eeects in gridworld and simulated robotic environments. In these environments, both eeects were signiicant, but the eeect of value propagation was larger. We also compare the accelerations of value propagation due to macro-actions and eligibility traces in the gridworld environment. Although eligibility traces increased the rate of convergence to the optimal value function compared to learning with macro-actions but without eligibility traces, eligibility traces did not permit the optimal policy to be learned as quickly as it was using macro-actions.
منابع مشابه
Macro Actions in Reinforcement Learning An Empirical Analysis
Several researchers have proposed reinforcement learning methods that obtain ad vantages in learning by using temporally extended actions or macro actions but none has carefully analyzed what these advantages are In this paper we separate and an alyze two advantages of using macro actions in reinforcement learning the e ect on exploratory behavior independent of learning and the e ect on the sp...
متن کاملRoles of Macro - Actions in Accelerating Reinforcement
We analyze the use of built-in policies, or macro-actions, as a form of domain knowledge that can improve the speed and scaling of reinforcement learning algorithms. Such macro-actions are often used in robotics, and macro-operators are also well-known as an aid to state-space search in AI systems. The macro-actions we consider are closed-loop policies with termination conditions. The macro-act...
متن کاملPlanning with Closed-Loop Macro Actions
Planning and learning at multiple levels of tempo ral abstraction is a key problem for arti cial intelli gence In this paper we summarize an approach to this problem based on the mathematical framework of Markov decision processes and reinforcement learn ing Conventional model based reinforcement learning uses primitive actions that last one time step and that can be modeled independently of th...
متن کاملLearning Options in Reinforcement Learning
Temporally extended actions (e.g., macro actions) have proven very useful in speeding up learning, ensuring robustness and building prior knowledge into AI systems. The options framework (Precup, 2000; Sutton, Precup & Singh, 1999) provides a natural way of incorporating such actions into reinforcement learning systems, but leaves open the issue of how good options might be identified. In this ...
متن کاملDescription and Acquirement of Macro-Actions in Reinforcement Learning
Reinforcement learning is a framing of enabling agents to learn from interaction with environments. It has focused generally on Markov decision process (MDP) domains, but a domain may be non-Markovian in the real world. In this paper, we develop a new description of macro-actions for non-Markov decision process (NMDP) domains in reinforcement learning. A macro-action is an action control struct...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998